Classifying User Messages For Managing Web Forum Data

نویسندگان

  • Sumit Bhatia
  • Prakhar Biyani
  • Prasenjit Mitra
چکیده

Online discussion forums have become a popular medium for users to discuss with and seek information from other users having similar interests. A typical discussion thread consists of a sequence of posts posted by multiple users. All the posts in a thread are not equally useful and serve a different purpose providing different types of information (some posts contain questions, some answers, etc.). Identifying the purpose and nature of each post in a discussion thread is an interesting research problem as it can help in improving information extraction and intelligent assistance techniques [9]. We study the problem of classifying a given post as per its purpose in the discussion thread. We employ features based on the post’s content, structure of the thread, behavior of the participating users and sentiment analysis of post’s content. We achieve decent classification performance and also analyze the relative importance of different features used for the post classification task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Technique for Improving Web Mining using Enhanced Genetic Algorithm

World Wide Web is growing at a very fast pace and makes a lot of information available to the public. Search engines used conventional methods to retrieve information on the Web; however, the search results of these engines are still able to be refined and their accuracy is not high enough. One of the methods for web mining is evolutionary algorithms which search according to the user interests...

متن کامل

A General Framework for Building Applications with Short and Sparse Documents

with the explosion of e-commerce and online communication and publishing, texts become available in a variety of genres like Web search snippets, forum and chat messages, blogs, book and movie summaries, product descriptions, and customer reviews. Successfully processing them, therefore, becomes increasingly important in many Web applications. However, matching, classifying, and clustering thes...

متن کامل

A Recommender System Approach for Classifying User Navigation Patterns Using Longest Common Subsequence Algorithm

Prediction of user future movements and intentions based on the users’ clickstream data is a main challenging problem in Web based recommendation systems. Web usage mining based on the users’ clickstream data has become the subject of exhaustive research, as its potential for web based personalized services, predicting user near future intentions, adaptive Web sites and customer profiling is re...

متن کامل

Classifying User Forum Participants: Separating the Gurus from the Hacks, and Other Tales of the Internet

This paper introduces a novel user classification task in the context of web user forums. We present a definition of four basic user characteristics and an annotated dataset. We outline a series of approaches for predicting user characteristics, utilising aggregated post features and user/thread network analysis in a supervised learning context. Using the proposed feature sets, we achieve resul...

متن کامل

Web pages ranking algorithm based on reinforcement learning and user feedback

The main challenge of a search engine is ranking web documents to provide the best response to a user`s query. Despite the huge number of the extracted results for user`s query, only a small number of the first results are examined by users; therefore, the insertion of the related results in the first ranks is of great importance. In this paper, a ranking algorithm based on the reinforcement le...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012